The Self-Extending Phrasal Lexicon

نویسندگان

  • Uri Zernik
  • Michael G. Dyer
چکیده

Lexical representation so far has not been extensively investigated in regard to language acquisition. Existing computational linguistic systems assume that text analysis and generation take place in conditions of complete lexical knowledge. That is, no unknown elements are encountered in processing text. It turns out however, that productive as well as non-productive word combinations require adequate consideration. Thus, assuming the existence of a complete lexicon at the outset is unrealistic, especially when considering such word combinations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Expert Lexicon Approach to Identifying English Phrasal Verbs

Phrasal Verbs are an important feature of the English language. Properly identifying them provides the basis for an English parser to decode the related structures. Phrasal verbs have been a challenge to Natural Language Processing (NLP) because they sit at the borderline between lexicon and syntax. Traditional NLP frameworks that separate the lexicon module from the parser make it difficult to...

متن کامل

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data

In this work, the use of a phrasal lexicon for statistical machine translation is proposed, and the relation between data acquisition costs and translation quality for different types and sizes of language resources has been analyzed. The language pairs are Spanish-English and Catalan-English, and the translation is performed in all directions. The phrasal lexicon is used to increase as well as...

متن کامل

Tone and accent in Saramaccan: Charting a deep split in the phonology of a language

Saramaccan, an Atlantic creole spoken in Surinam, has traditionally been analyzed as exhibiting a high-tone/low-tone opposition in its lexicon. However, while it is true that part of its lexicon exhibits a robust high/low opposition, the majority of its words are marked not for tone but pitch accent. The Saramaccan lexicon, therefore, is split with some words being marked for tone and other wor...

متن کامل

Principled Induction of Phrasal Bilexica

We aim to replace the long and complicated, pipeline employed to produce probabilistic phrasal bilexica with a theoretically principled, grammar based, approach. To this end, we introduce a learning regime to learn a phrasal grammar equivalent to linear transduction grammars. The stochastic version of this new grammar type also has the property that the set of biterminals constitute a natural p...

متن کامل

Phrasal verbs between syntax and lexicon

Phrasal verbs have some structural and semantic characteristics in common with morphologically complex words, even though they originate from phrasal constructions. Focusing on the role played by lexicalization and grammaticalization processes in the gradual shift from syntactic to morphological structures, this paper deals with semantic and morphotactic characteristics of Italian phrasal verbs...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Linguistics

دوره 13  شماره 

صفحات  -

تاریخ انتشار 1987